Visual, Laughter, Applause and Spoken Expression Features for Predicting Engagement Within TED Talks

نویسندگان

Fasih Haider

Fahim A. Salim

Saturnino Luz

Carl Vogel

Owen Conlan

Nick Campbell

چکیده

There is an enormous amount of audio-visual content available on-line in the form of talks and presentations. The prospective users of the content face difficulties in finding the right content for them. However, automatic detection of interesting (engaging vs. non-engaging) content can help users to find the videos according to their preferences. It can also be helpful for a recommendation and personalised video segmentation system. This paper presents a study of engagement based on TED talks (1338 videos) which are rated by on-line viewers (users). It proposes novel models to predict the user’s (on-line viewers) engagement using high-level visual features (camera angles), the audience’s laughter and applause, and the presenter’s speech expressions. The results show that these features contribute towards the prediction of user engagement in these talks. However, finding the engaging speech expressions can also help a system in making summaries of TED Talks (video summarization) and creating feedback to presenters about their speech expressions during talks.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fostering User Engagement: Rhetorical Devices for Applause Generation Learnt from TED Talks

One problem that every presenter faces when delivering a public discourse is how to hold the listeners’ attentions or to keep them involved. Therefore, many studies in conversation analysis work on this issue and suggest qualitatively constructions that can effectively lead to audience’s applause. To investigate these proposals quantitatively, in this study we analyze the transcripts of 2,135 T...

متن کامل

Predicting Audience's Laughter During Presentations Using Convolutional Neural Network

Public speakings play important roles in schools and work places and properly using humor contributes to effective presentations. For the purpose of automatically evaluating speakers’ humor usage, we build a presentation corpus containing humorous utterances based on TED talks. Compared to previous data resources supporting humor recognition research, ours has several advantages, including (a) ...

متن کامل

Audio Hot Spotting And Retrieval Using Multiple Features

This paper reports our on-going efforts to exploit multiple features derived from an audio stream using source material such as broadcast news, teleconferences, and meetings. These features are derived from algorithms including automatic speech recognition, automatic speech indexing, speaker identification, prosodic and audio feature extraction. We describe our research prototype – the Audio Ho...

متن کامل

Multichannel Attention Network for Analyzing Visual Behavior in Public Speaking

Public speaking is an important aspect of human communication and interaction. The majority of computational work on public speaking concentrates on analyzing the spoken content, and the verbal behavior of the speakers. While the success of public speaking largely depends on the content of the talk, and the verbal behavior, non-verbal (visual) cues, such as gestures and physical appearance also...

متن کامل

A Study on Natural Expressive Speech: Automatic Memorable Spoken Quote Detection

This paper presents a study on natural expressive speech during public talks. Specifically, we focus on how people convey important messages that may be retained in the audience’s consciousness. Our study aims to answer several questions. Why are some public speeches memorable and inspirational for the audience, while others are not? Why are some memorable/inspirational spoken quotes more popul...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2017

Visual, Laughter, Applause and Spoken Expression Features for Predicting Engagement Within TED Talks

نویسندگان

چکیده

منابع مشابه

Fostering User Engagement: Rhetorical Devices for Applause Generation Learnt from TED Talks

Predicting Audience's Laughter During Presentations Using Convolutional Neural Network

Audio Hot Spotting And Retrieval Using Multiple Features

Multichannel Attention Network for Analyzing Visual Behavior in Public Speaking

A Study on Natural Expressive Speech: Automatic Memorable Spoken Quote Detection

عنوان ژورنال:

اشتراک گذاری